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SUBSTITUTE SPECIFICATION 

PATTERN INSPECTION METHOD AND ITS APPARATUS 

BACKGROUND OF THE INVENT ION 
The present invention relates to an inspection method and apparatus for 
5 comparing an image of an object, which is obtained by using light, laser beams, 
or the like, and a reference image to detect a fine pattern defect, a foreign body, 
or the like from a difference between the images. In particular, the present 
invention relates to a pattern inspection apparatus that is designed preferably for 
performing visual inspection of a semiconductor wafer, a TFT, a photomask, and 
10 the like, and a method therefor. 

As an example of a conventional technique for comparing an inspection 
object image and a reference image to detect a defect, reference is made to a 
method described in JP-A-05-264467. This method involves sequentially 
sensing images of an inspection object specimen using a line sensor, in which 
15 repetitive patterns are arranged regularly, comparing the sensed images with 
images delayed by a time for establishing a repetitive pattern pitch, and 
detecting a non-coincident part of the images as a pattern defect. 

Such a conventional inspection method will be described in conjunction 
with the visual inspection of a semiconductor wafer, as an example. In a 
20 semiconductor wafer which serves as an object of inspection, as shown in Fig. 6, 
a large number of chips having an identical pattern are arranged regularly. As 
shown in Fig. 7, each chip can be roughly divided into a memory mat section 71 
and a peripheral circuit section 72. The memory mat section 71 consists of a set 
of small repetitive patterns (cells), and the peripheral circuit section 72 basically 
25 consists of a set of random patterns. In general, in the memory mat section 71 , 
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the pattern density is high, and an image thereof obtained by a bright-field 
illumination optical system tends to be dark. On the other hand, in the peripheral 
circuit section 72, the pattern density is low, and an image thereof tends to be 
bright. 

5 In the conventional visual inspection, images in the same positions of two 

chips adjacent to each other, for example, an area 61, an area 62, and the like, 
as seen in Fig. 6, are compared, and a difference between the images is 
detected as a defect. In this case, since there may be a vibration of the stage 
which holds the object, inclination of the object, or the like, the positions of the 

10 two images do not always coincide with each other. Thus, the amount of 

positional deviation of the image sensed by the sensor and the image delayed by 
the repetitive pattern pitch is determined, the two images are aligned on the 
basis of the amount of positional deviation, and then a difference between the 
images is calculated. When the difference is larger than a specified threshold 

15 value, it is judged that there is a defect in the pattern; and, when the difference is 
smaller than the threshold value, it is judged that there is no defect in the pattern. 

In the alignment of two images in the comparative inspection, it is a 
general practice to set the edge parts in the images as one piece of information 
for calculation of the amount of positional deviation and to calculate the amount 

20 of positional deviation such that deviation of the edge parts between the images 
is minimized. Actually, a method has been proposed using normalized cross 
correlation, as well as a method using a sum of residuals, and the like. However, 
in any of such methods, since the amount of calculation is enormous, in order to 
realize speedup of the inspection, various measures have been required, such 

25 as changing the positional deviation calculation section to hardware or 

increasing the number of arithmetic operation circuits, and change of the image 
processing algorithm, such as simplification of the calculation of the amount of 
positional deviation. 
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In addition, in a semiconductor wafer which serves as the object of 
inspection, a slight difference in thickness occurs in a pattern due to 
planarization by CMP or the like, and so there is a difference in the brightness 
locally in images among chips. For example, reference numeral 41 in Fig. 4A 
5 denotes an example of an inspection object image, and reference numeral 42 in 
Fig. 4B denotes an example of a reference image. As indicated by the pattern 4a 
in Fig. 4A and the pattern 4b in Fig. 4B, a difference in the brightness occurs in 
an identical pattern of the inspection object image and the reference image. 

In addition, there is a defect 4d in the inspection object image 41 of Fig. 

10 4A. A difference image in this case is as shown in Fig. 4C. The difference image 
is an image represented by a density difference according to a differential value 
in corresponding positions of an inspection object image and a reference image. 
A waveform of a differential value in position 1D-1D' is as shown in Fig. 4D. 
With respect to such an image, if a part where the differential value is equal to or 

15 more than a specific threshold value TN is regarded as a defect, as in the 

conventional system, a differential value 4c of the patterns 4a and 4b, which are 
different in brightness, is detected as a defect. However, this condition should 
not originally be detected as a defect. In other words, this is a false defect or a 
nuisance defect (hereinafter referred to as a false defect). Conventionally, as 

20 one method of avoiding the occurrence of a false defect, such as indicated by 
the differential value 4c of Fig. 4C, the threshold value TN is increased (from TH 
to TH2 in Fig. 4D). However, this leads to a decrease in the sensitivity, and the 
defect 4d with a differential value of the same or lower level cannot be detected. 
In addition, a difference in the brightness due to a difference in the 

25 thickness may occur only among specific chips within a wafer such as shown in 
Fig. 6, or it may occur only in a specific pattern within a chip. However, if the 
threshold value TH is adjusted to these local areas, the overall inspection 
sensitivity will be extremely decreased. 
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SUMMARY OF THE INVENTION 
The present invention is directed to a pattern inspection apparatus for 
comparing images of corresponding areas of two patterns, which are formed so 
5 as to be identical, to judge that a non-coincident part of the images is a defect, 
and serves to reduce the influence of unevenness in brightness of patterns 
caused by a difference in the thickness or the like, so as to realize a highly 
sensitive pattern inspection. In addition, the present invention realizes high- 
speed pattern inspection without the need for changing the image comparison 
10 algorithm. Consequently, the problems of the conventional inspection technique 
can be solved. 

In other words, in accordance with the present invention, a pattern 
inspection apparatus for comparing images of corresponding areas of two 
patterns, which are formed so as to be identical, so as to judge that a non- 
15 coincident part of the images is a defect, includes means for performing 
processing for detecting an image with an image sensor, processing for 
comparing images, and processing forjudging a defect at different times. As a 
specific example of such means, the pattern inspection apparatus includes 
means for performing processing for comparing images in parallel in plural 
20 areas. Therefore, the pattern inspection apparatus includes plural units for 

performing processing for comparing images and a number of memories equal 
to or less than the number of units. In addition, the pattern inspection apparatus 
includes means for writing data in a memory of the image sensor and reading 
out data from the memory at different times. 
25 Further, the pattern inspection apparatus includes means for converting 

gradation of an image signal among compared images by different plural 
processing units. Consequently, even in the case in which an object of 
inspection is a semiconductor wafer and a difference in brightness occurs in an 
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identical pattern among images due to a difference in thickness in the wafer, or 
where there is a fluctuation in the quantity of illumination light and a fluctuation in 
the sensitivity for each pixel of the image sensor, or unevenness in the quantity 
of light accumulation time, a defect can be detected correctly. 
5 These and other objects, features and advantages of the invention will be 

apparent from the following more particular description of preferred 
embodiments of the invention, as illustrated in the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
10 In the accompanying drawings: 

Fig. 1 is a block diagram showing an example of the structure of an 
inspection apparatus according to the present invention; 

Fig. 2 is an image diagram showing an example of the flow of image data 
in the course of parallel processing by two image comparison processing units; 
15 Fig. 3 is an image diagram showing another example of the flow of image 

data in the course of parallel processing by two image comparison processing 
units; 

Fig. 4A is a diagram showing an inspection object image at the time when 
there is unevenness in brightness among compared chips in an example of a 
20 conventional threshold value setting method; 

Fig. 4B is a diagram showing a reference image in the conventional 
threshold value setting method; 

Fig. 4C is a difference image in the conventional the threshold value 
setting method; 

25 Fig. 4D is a waveform diagram of a differential value in position 1 D-1 D'; 

Fig. 5 is a plan view of a semiconductor wafer and shows an enlarged 
view of an example of an image comparison processing unit; 

Fig. 6 is a plan view of a semiconductor wafer and shows an enlarged 
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view of a chip; 

Fig. 7 is a plan view showing an example of the structure of a chip; 

Fig. 8 is a flow diagram showing a flow of processing for adjusting a 
difference in brightness which occurs in a belt shape; 
5 Fig. 9 is a flow diagram showing a flow of processing in the image 

comparison processing unit; 

Fig. 1 0A is a diagram illustrating processing for adjusting a difference in 
brightness which occurs in a belt shape and shows an example of adjusting the 
brightness in a row direction; 
10 Fig. 1 0B is a diagram illustrating processing for adjusting a difference in 

brightness which occurs in a belt shape and shows an example of adjusting the 
brightness in a column direction; 

Fig. 10C is a diagram illustrating processing for adjusting a difference in 
brightness which occurs in a belt shape and shows an example of adjusting the 
15 brightness in a row direction and a column direction; 

Fig. 1 1 A is a diagram showing an example of a result of processing by a 
brightness correction unit and shows a difference image after alignment; 

Fig. 1 1 B a diagram showing an example of a result of processing by the 
brightness correction unit and shows an image in which a difference in 
20 brightness is corrected at a high frequency (by a unit of linear or belt-like area); 

Fig. 1 1C a diagram showing an example of a result of processing by the 
brightness correction unit and shows an image in which a difference in 
brightness is corrected at a low frequency (by a unit of a wide area); 

Fig. 1 1 D is a diagram showing an example of a result of processing of the 
25 brightness correction unit and shows a luminance waveform diagram along line 
1D-1D' in Fig. 11 A; 

Fig. 1 1 E is a diagram showing an example of a result of processing by the 
brightness correction unit and shows a luminance waveform diagram of Fig. 1 1C 
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corresponding to line 1D-1D' of Fig. 11 A; 

Fig. 12 shows an example of a proper threshold value according to 
coordinates in a wafer and coordinates in a chip in a plan view of a 
semiconductor wafer; 
5 Fig. 13 is a flow diagram showing a flow of processing for setting a 

threshold value for each of the coordinates in a wafer and the coordinates in a 
chip; 

Fig. 14 is a plan view of a semiconductor wafer showing an example of 
inspection chip setting at the time of trial inspection; 
10 Fig. 15 is a front view of a displayed screen showing an example of 

superimposed indication of design data of a chip and processing results; 

Fig. 16A is a graph showing an example of setting a threshold value from 
characteristic amounts of a defect and a false defect and shows a relation 
between a density difference value and the frequency of the defect and the false 
15 defect; 

Fig. 16B is a graph showing an example of setting a threshold value from 
characteristic amounts of a defect and a false defect and shows a relation 
between an area and a density difference value of the defect and the false 
defect; 

20 Fig. 17 is a flow diagram showing a flow of processing for adjusting a 

difference of brightness which occurs at random; 

Fig. 18A is a diagram showing a reference image, a detected image, and 
a difference image for which the brightness by a unit of belt-like area is adjusted; 
Fig. 18B is a graph showing a characteristic amount space which is 
25 formed by the reference image and the detected image; 

Fig. 18C is a scatter diagram in a characteristic amount space with a 
brightness of the detected image plotted on the X axis and a brightness of the 
reference image plotted on the Y axis; 



7 



Fig. 18D shows scatter diagrams in which the scatter diagram of Fig. 18C 
is divided for each characteristic amount; 

Figs. 19A and 19B are diagrams showing an example of a segment 
dividing method for a characteristic space; and 
5 Fig. 20 is a flow diagram showing a processing procedure for calculating a 

divided threshold value for the segment division. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
An embodiment of the present invention will be described in detail with 

10 reference to Figs. 1 to 20. 

As an example of the present invention, a defect inspection method 
employed in an optical visual inspection apparatus targeting a semiconductor 
wafer will be considered. Fig. 1 shows an example of the structure of the 
apparatus. Reference numeral 1 1 denotes a specimen (an object to be 

15 inspected, such as a semiconductor wafer). The apparatus includes a stage 12 
on which the specimen 1 1 is placed and moved; and a detection unit 13, 
including a light source 101, which illuminates the specimen 11, an illumination 
optical system 102, which condenses light emitted from the light source 101, an 
object lens 103, which illuminates the specimen 11 with the illumination light 

20 condensed by the illumination optical system 102 and focuses an optical image 
obtained by reflection of the illumination light, an image sensor 104, which 
receives the focused optical image and converts the focused optical image into 
an image signal corresponding to a value of brightness, and an A/D conversion 
unit 105, which converts an input signal received from the image sensor 104 into 

25 a digital signal. 

Here, a lamp is used as the light source 101 in the example shown in Fig. 
1. However, a laser may be used. In addition, light emitted from the light source 
101 may be light of a short wavelength, or it may be light of a wavelength in a 



wide band (white light). In the case in which light having a short wavelength is 
used, in order to increase the resolution of an image to be detected (to detect a 
fine defect), light having a wavelength in the ultraviolet region (ultraviolet light: 
UV light) may be used. 
5 In addition, it is possible to detect a defect at a relatively high speed and 

with a high sensitivity by adopting a time delay integration image sensor (TDI 
image sensor), which is constituted by arranging plural one-dimensional image 
sensors two-dimensionally, as the image sensor 104, to transfer a signal 
detected by each one-dimensional image sensor to the one-dimensional image 

10 sensor in the next stage in synchronization with movement of the stage 13 and 
to add up the signals. * 

The apparatus also includes an image editing unit 14, which includes a 
pre-processing section 106, which applies image correction, such as shading 
correction and dark level correction, to a digital signal of an image detected in 

15 the detection unit 13, and image memories 107-1 and 107-2 in which digital 
signals of a detected image to be an object of comparison and a reference 
image are stored. 

The apparatus further includes an image comparison processing unit 
which calculates defect candidates in a wafer used as a specimen. The 

20 apparatus uses plural image comparison processing units (15-1, 15-2). The 

image comparison processing unit 15-1 compares two images (a detected image 
and a reference image) stored in an image memory 107-1 of the image editing 
unit 14 and judges that a part where a differential value is larger than a threshold 
value is a defect. First, the image comparison processing unit 15-1 reads out 

25 digital signals of the detected image and the reference image stored in the image 
memory 107-1, calculates an amount of positional deviation for adjusting the 
deviation of positions in a positional deviation detection section 108-1, and 
calculates an amount of signal correction for adjusting the deviation of the 



brightness between the detected image and the reference image in a brightness 
correction section 109-1. Then, the image comparison processing unit 15-1 
compares corresponding positions of the detected image and the reference 
image in brightness in an image comparison section 110-1 using the calculated 
5 amount of positional deviation and amount of signal correction, and identifies a 
part where a differential value is larger than a specific threshold value as a 
defect candidate. A threshold value setting section 111-1 sets a threshold value, 
which is used in extracting a defect candidate from a differential value for each 
area and gives the threshold value to the image comparison section 110-1. 

10 In the image comparison processing unit 15-2, processing is performed 

according to the same procedure as the image comparison processing unit 15-1 
using two images stored in an image memory 107-2. 

The apparatus has an overall control unit 16, including a user interface 
section 112 which provides display means and input means for receiving a 

15 change of inspection parameters (a threshold value and the like used in image 
comparison) from a user and for displaying information on a detected defect, a 
storage 113 which stores a characteristic amount of a detected defect candidate, 
an image, and the like, and a CPU (incorporated in the overall control unit 16) 
which performs various controls. This is a mechanical controller 114 which 

20 drives the stage 12 on the basis of a control command received from the overall 
control unit 16. Note that the image comparison processing units 15-1 and 15-2, 
the detection unit 13, and the like are also driven according to a command from 
the overall control unit 16. 

In the semiconductor wafer 1 1 which serves as an object of inspection, a 

25 large number of chips of an identical pattern are arranged regularly as shown in 
Fig. 6. In the inspection apparatus of Fig. 1, the semiconductor wafer 1 1 is 
continuously moved by the stage 12 in response to the overall control unit 16, 
and in synchronization with this, images of the chips are sequentially captured 



from the detection unit 13, digital image signals of the same positions of two 
adjacent chips, for example, the area 61 and the area 62 in Fig. 6, are compared 
as a detected image and a reference image, respectively, according to the 
above-mentioned procedure, and a difference between the images is detected 
5 as a defect. 

Here, the inspection apparatus of this example has the two image 
comparison processing units 15-1 and 15-2 and performs processing in parallel. 
In addition, the image memories 107-1 and 107-2 also correspond to the 
respective image comparison processing units. Fig. 2 is a diagram showing the 

10 flow of processing. Here, a case in which seven chips, namely, chips 1 to 7, are 
inspected will be described. 

First, the image sensor 104 sequentially captures images of the chips. In 
association with the image capturing speed of the sensor, the image editing unit 
14 performs pre-processing of the chip images in the pre-processing section 

15 106, and the chip images are written in a memory 107. In this case, the image of 
the chip I is written in the memory 10 7-1 for the image comparison processing 
unit 15-1, the image of the chip 7 is written in the memory 10 7-2 for the image 
comparison processing unit 15-2, and the images of the chips 2 to 6 are written 
in both the memories 107-1 and 107-2. On the other hand, the image 

20 comparison processing units 15-1 and 15-2 read out images from the memories 
107-1 and 107-2, respectively, at a speed which is half the writing speed, and 
when two images are readout, image comparison is performed, such as 
calculation of an amount of positional deviation, calculation of an amount of 
signal correction, and extraction of defect candidates in parallel with the reading- 

25 out of images. The processing speed of the image comparison also is half the 
image capturing speed of the sensor. The image comparison processing units 
15-1 and 15-2 mutually perform inspection of the chips of the other unit in 
parallel. 



Fig. 3 is a diagram showing the flow of another kind of processing. Here, 
in the time for capturing an image of one chip, the two image comparison 
processing units 15-1 and 15-2 perform image comparison of an area for a half 
chip, respectively, (e.g., with respect to a length L of the chip, the image 
5 comparison processing unit 15-1 performs image comparison of a former half L/2 
and the image comparison processing unit 15-2 performs image comparison of a 
latter half L/2). First, the image sensor 104 sequentially captures the images of 
the chips. In association with the image capturing speed of the sensor, the 
image editing unit 14 performs pre-processing of the chip images, and it writes 

10 the image of the former half L/2 of each chip in the memory 107-1 for the image 
comparison processing unit 15-1 and writes the image of the latter half L/2 of 
each chip in the memory 107-2 for the image comparison processing unit 15-2. 
On the other hand, the image comparison processing units 15-1 and 15-2 read 
out images at a speed which is half the writing speed, respectively, and when 

15 images for the two chips are read out, image comparison is performed, such as 
calculation of an amount of positional deviation, calculation of an amount of 
signal correction, and extraction of a defect candidate in parallel with the 
reading-out of the images. As shown in Fig. 3, the processing speed for the 
image comparison is also half the image capturing speed of the sensor. While an 

20 image for one chip is captured, the image comparison processing units 15-1 and 
15-2 perform inspection for a half length of the chip, respectively, in parallel. 

As described above, in a case in which the speed of the detection 
processing of an image and the speed of the comparison processing are 
different, in other words, the inspection speed corresponding to the capturing 

25 speed of an image sensor can be realized by causing plural image comparison 
processing units to perform processing in parallel, even if the processing speed 
of the image comparison processing unit 15 is half the image capturing speed for 
an object chip by the image sensor 104 and the image editing speed. For 



example, if the image capturing speed of the image sensor is 1.6 Gpps (pps: 
pixel per second) at the fastest, when it is converted from a relation of an amount 
of acquired light or the like, even if the processing capability of the image 
comparison processing unit 15 is only 0.8 Gpps, which is half the image 
5 capturing speed, it becomes possible to realize an inspection processing speed 
of 1 .6 Gpps by adopting this constitution. Naturally, even in the case in which 
the speed of the image sensor is higher, it is possible to cope with this speed by 
constituting the image comparison processing unit 15 described above with three 
or more image comparison processing units to cause them to process the 

10 acquired image signals in parallel. In addition, it is also possible to cope with a 
case in which an image capturing range of the image sensor is increased. 

In the above-mentioned embodiment, a case in which the image sensor 
104 has a single output is described. However, even if the image sensor 104 is 
of a type provided with plural output terminals and outputs plural signals in 

15 parallel, signals can be processed in the same manner as the above-mentioned 
embodiment to perform image processing at a higher speed. In this case, plural 
signal lines are led out from the image sensor 104, and the plural signal lines are 
connected to plural A/D converters 105 corresponding thereto, respectively, and 
outputs from the plural A/D converters 105 are inputted to the image editing unit 

20 14 and are processed in the procedure as described above. 

Next, the processing in the image comparison processing units 15-1 and 
15-2 will be described in detail. First, the detected image signal and the 
reference image signal to be continuously inputted to the memory 107 are read 
out in synchronization with the movement of the stage. Since these image 

25 signals of two chips are not signals that relate completely to the same position if 
the stage is vibrated or a wafer set on the stage is inclined, the amount of 
positional deviation between the two images is calculated in the positional 
deviation detection section 108. The calculation of the amount of positional 



deviation is sequentially performed with a specific length in the traveling direction 
of the stage as one processing unit. Reference numeral 51, 52, ... in Fig. 5 
denote respective processing areas in the case in which a length D (pixel) is 
treated as one processing unit. This unit processing area will be hereinafter 
5 described as a unit. 

In this way, the amount of positional deviation is sequentially calculated 
unit by unit with respect to an image to be inputted, for example, the amount of 
positional deviation is calculated in the unit 51 and a unit of an adjacent chip 
corresponding to the unit 51, and then the amount of positional deviation is 

10 calculated in the unit 52 and a unit of an adjacent chip corresponding to the unit 
52. For the calculation of an amount of positional deviation, there are various 
methods using normalized cross correlation among images, a sum of density 
differences among images, a sum of squares of density differences among 
images, and the like. Any of such methods may be used. Further, the alignment 

15 of two images is performed unit by unit on the basis of the calculated amount of 
positional deviation. 

Here, in order to perform highly accurate alignment with respect to an 
amount of positional deviation calculated unit by unit, in accordance with the 
present invention, the reliability of the amount of positional deviation calculated 

20 in each unit is evaluated (e.g., in the case in which an amount of positional 

deviation is calculated according to normalized cross correlation, a magnitude of 
a correlation coefficient). In a unit with low reliability (e.g., small correlation 
coefficient), the amount of positional deviation calculated in the unit is not used, 
and the amount of positional deviation is calculated from an amount of positional 

25 deviation calculated in a unit with higher reliability before or after (in terms of 
time) the unit by interpolation or extrapolation. Consequently, it becomes 
possible to perform alignment following distortion of an image due to vibration of 
the stage even in the case in which there is only a little pattern information for 



calculation of the amount of positional deviation of the unit. 

On the other hand, in the case in which plural output signal lines are 
connected to plural A/D converters 105 from the image sensor 104, and outputs 
from the plural A/D converters 105 are inputted to the image processing unit 14, 
5 the calculation of an amount of positional deviation unit by unit is also performed 
in parallel, and the alignment is also performed in parallel. A unit of an area 
which is divided in a direction perpendicular to a traveling direction of the stage 
and in which parallel processing is performed is hereinafter described as a 
channel. Here, in order to perform highly accurate alignment by a unit of a 

10 channel as well following distortion, such as vibration of the stage and fluctuation 
in magnification of an optical system, it is also possible to extract an amount of 
positional deviation with high reliability (e.g., with a high correlation coefficient) 
from plural amounts of positional deviation calculated for each channel, subject 
the extracted plural pieces of positional deviation information to processing, such 

15 as compilation and statistical processing, to calculate an amount of positional 
deviation of a channel with low reliability. 

As an example of the calculation, one amount of positional deviation of a 
channel with highest reliability among all channels is found and is set as an 
amount of positional deviation of a channel with low reliability. Alternatively, 

20 plural pieces of positional deviation information with high reliability are subjected 
to processing, such as compilation and statistical processing, to uniquely 
calculate an amount of positional deviation common to all the channels to set the 
amount of positional deviation as an amount of positional deviation for each 
channel. Alternatively, it is also possible to interpolate an amount of positional 

25 deviation of a channel with low reliability from amounts of positional deviation of 
plural channels with high reliability by interpolation or extrapolation. The 
interpolation or extrapolation may be performed by linear interpolation, or 
distortion due to vibration or the like may be represented by curve 



approximation, such as spline approximation. Consequently, it becomes possible 
to perform alignment following distortion of an image due to vibration of the stage 
or the like, even in the case in which there is only a little pattern information for 
calculation of an amount of positional deviation of a channel. 
5 Next, the amount of signal correction for adjusting for a deviation in 

brightness in the brightness correction section 109 is calculated for the aligned 
two images. As a cause of the deviation of brightness, there are factors such as 
(1) slight difference in the thickness among chips of a semiconductor wafer 
serving as an object to be inspected, (2) a difference in the sensitivity of each 

10 pixel of an image sensor, (3) a difference in the amount of accumulated light due 
to unevenness of the speed of movement of a stage, and (4) fluctuation in an 
amount of illumination light. Among the factors, whereas factor (1) occurs at 
random depending upon a pattern of the semiconductor wafer, factors (2), (3), 
and (4) occur linearly or in a belt shape on a detected image as characteristics 

15 inherent in the inspection apparatus. 

Thus, in accordance with the present invention, first, a difference in the 
brightness, which occurs in a belt shape as in factors (2), (3), and (4), is adjusted 
at a specific high frequency, and a difference in the brightness, which occurs at 
random as in factor (1), is adjusted at a period different from the period (i.e., in a 

20 different processing area). 

Fig. 8 shows a flow of processing for the adjusting a difference in the 
brightness which occurs linearly or in a belt shape. First, with respect to a 
detected image or a reference image for which alignment has been performed, 
the amount of correction required for adjusting for an unevenness in the 

25 brightness (unevenness of brightness by a unit of row), which occurs in a 

direction perpendicular to the traveling direction of the stage, is calculated (step 
81 in Fig. 8). 

An example of a method of calculating the amount of correction of 
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brightness at a highest frequency, that is, for each row, will be described below. 
First, as shown in Fig. 10A, with respect to an object row (M pixels x one row), 
an amount of correction for adjusting the brightness is calculated using a value 
of N rows (N pixels x N rows) in the vicinity including the row. As an example of a 
5 method of calculating the amount of correction, a statistical amount is calculated 
as indicated by expression (1) below. 

10 E G = YYG(i,j) 

1U ° (NxM) u m 
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_L y y } _ )2 ...expression (1) 



Here, F(i, j) and G(i j) indicate the brightness of a detected image and a 
reference image after alignment in a position (i, j). Then, an amount of correction 
is calculated as indicated by expression (2) below. 
20 gain x =cjF/aG 

offsetx=Ergain-Eo ...expression (2) 

Correction of the object row is performed with respect to the reference 
image as indicated by expression (3) below. 

G x (i,j)=gain x G(i,j)+offset x ...expression (3) 

25 Similarly, an amount of correction for adjusting unevenness in the 

brightness (unevenness of brightness by a unit of column), which occurs in 
parallel with the traveling direction of the stage, is calculated (step 82 in Fig. 8). 
First, as shown in Fig. 10B, an amount of correction for adjusting the brightness 
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is calculated with respect to an object column (one column x L pixels) using a 
value of N columns (N columns x L pixels) in the vicinity including the column, 
and correction of the object column is performed with respect to the reference 
image as indicated by expression (4) below. 
5 Gy(ij)=gain y G(i,j)+offset x ...expression (4) 

Then, as shown in Fig. 10C a logical product is found to calculate an 
amount of correction of a brightness of each pixel which occurs linearly or in a 
belt shape (step 83 in Fig. 8). 

If (F(i,j)-Gx(i,j))<(F(i,j)-Gy(i,j)) - Amount of correction = gainx, offsetx 
10 Otherwise — Amount of correction = gainy, offsety 

Next, an amount of correction for adjusting for a difference in the 
brightness, which occurs at random depending upon the pattern of a 
semiconductor wafer, is calculated at a frequency different from the amount of 
correction for the linear difference of brightness, that is, for each area of a 
15 different size. 

As shown in the final difference image shown in Fig. 10C, a belt-like 
difference of brightness (noise), which occurs vertically or horizontally in a 
traveling direction of the stage, is reduced by finding a logical product of the 
respective pixels. However, the difference in the brightness which occurs at 
20 random remains without being corrected. Consequently, in accordance with the 
present invention, adjustment of the brightness targeting the random noise, 
which remains in this way, is performed next. An example of a procedure 
therefor is shown in Figs. 17 and 18A to 18C. 

First, as shown in Fig. 18A to 18C, with respect to a detected image and a 
25 reference image for which a belt-like brightness is adjusted, a characteristic 
amount of each pixel in a specific area is calculated to form a characteristic 
space of two-dimensions or more, as shown in Fig. 18A (17-1). The 
characteristic amount maybe any amount as long as it indicates a characteristic 



of each pixel such as the contrast, the brightness, a second deviation value, a 
density difference between corresponding pixels, or a variance value using a 
near-field pixel. Next, the characteristic space is divided into plural segments 
(17-2), and an amount of correction is calculated for each segment using a 
5 statistical amount of pixels belonging to the segment (17-3). This is equivalent to, 
with respect to each pixel in areas of the detected image and the reference 
image, resolving a scatter diagram 18B according to pixels in the areas, in which 
a brightness of the detected image is plotted on the X axis and a brightness of 
the reference image is plotted on the Y axis, as shown in Figs. 18C and 18D, 

10 according to the characteristic amount, and an amount of correction is calculated 
in each scatter diagram. 

Figs. 19A and 19B show an example of a division method for dividing the 
characteristic space of Fig. 18A into segments. In accordance with the present 
invention, the segment division is automatically performed according to an object 

15 image. The upper graph in Fig. 19A shows an example of a characteristic space 
according to a brightness and a density difference, and the lower graph is a 
histogram showing the frequency of each brightness (luminance value), which is 
an example of determining a division threshold value in a brightness direction 
from a histogram of brightnesses of an object image. 

20 Fig. 20 is a flow diagram which shows an example of a processing 

procedure therefor. First, a histogram of luminance values in an object area is 
calculated (20-b). This may be calculated from the detected image or the 
reference image, or it may be calculated from an average value of two images. 
Next, the luminance histogram is smoothed to remove a small peak or the like 

25 (20-2), and a differential value of the smoothed histogram is calculated (20-3). 
Next, the differential value is checked from the side of lower brightness, and a 
luminance value at which the value changes to a positive value is set as a Start 
and a luminance value at which the value changes to negative value next is set 
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as an End (20-4). Then, a luminance value at which the differential value is 
maximized in a range from Start to End is set as a threshold value for division 

(20-5). 

Consequently, as shown in Fig. 19A, the division is performed in the part 
5 of the trough of the histogram. This indicates that the segment division is 
performed according to a pattern in the object area. It is possible to divide a 
luminance value according to the pattern in the image. It is also possible to 
divide a luminance value according to a fixed value set by a user as shown in 
Fig. 19B. 

10 For the amount of correction of each scatter diagram resolved by the 

segment division, a linear equation is found by a least square approximation 
within the scatter diagram, as shown in Figs. 18C and 18D, and an inclination 
and a y intercept are set as an amount of correction. In addition, an amount of 
correction may be found from pixels belonging to each segment, as shown in the 

15 above-mentioned expression 1 and expression 2. Further, an area for forming a 
characteristic space can be set as an arbitrary area of pixels of 1 x 1 or more. 
However, since a defect is also adjusted if correction is performed with a 1 x 1 
pixel of a highest frequency, the area is set so as to be slightly larger. 

A result of the correction is shown in Figs. 1 1 A to 1 1 E. Fig. 1 1 A shows a 

20 difference image after alignment. A defect is in the encircled part. Fig. 1 1 D 

shows a luminance waveform of two images after alignment of 1D-1D' including 
the defect part and a differential value at that point. The detected image has a 
defect, and the defect part is brighter than the part around it. However, the 
reference image is generally bright, and a differential value of the defect part is 

25 smaller than the part around it. On the other hand, Fig. 1 1B shows a difference 
image after correcting a difference of linear brightness at a high frequency. Belt- 
like unevenness of brightness is corrected. As a result, the luminance value is 
adjusted, as shown in Fig. 1 1E, and the defect is made manifest. However, this 
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depends upon a repetitive pattern, and a difference in the brightness which 
occurs at random is not corrected. 

Fig. 1 1C shows a difference image after calculating an amount of 
correction from a statistic amount of an area (in this example, the entire area of 
5 the image of Fig. 1 1 B), which is different from the adjustment of a linear noise, 
and correcting a difference in the brightness. In this way, the amounts of 
correction are calculated in two or more different processing areas, whereby it 
becomes possible to correct the differences in brightness which occur in different 
circumstances, and to make manifest a defect of a feeble signal, which is 
10 embedded and invisible in a strong unevenness of the brightness; and to detect 
the defect. 

In accordance with the present invention, with respect to the remaining 
random noise, adjustment of the brightness can be further performed for each 
area with similar characteristics. For example, when a characteristic space is 

15 formed in both a bright area, such as the peripheral circuit section 72 of Fig. 7, 
where a false defect easily occurs, and the memory mat section 71 , and an 
amount of correction is calculated, it is likely that a defect in the memory mat 
section 71 will be corrected. Thus, characteristic spaces are formed separately in 
the memory mat section 71 and the peripheral circuit section 72. In addition, 

20 some memory mat sections or some peripheral circuit sections are in an area 
where noise easily occurs, and other memory mat sections or other peripheral 
circuit sections are in an area where noise does not easily occur. In such a case, 
characteristic spaces are also formed separately. The areas can be divided on 
the basis of design information of chips, such as CAD data, chip layout data, or 

25 chip images, or they can be divided on the basis of a result of a test inspection. 
In addition, an area forming one characteristic space may be discrete 
(discontinuous). A method of dividing an area may be described in detail later. 
An effect of performing brightness correction of multiple stages with 
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respect to an inspection image, as described above, will be described for the 
case of inspecting a pattern, which is formed on a semiconductor wafer covered 
by an insulating film with an optical transparent and flat surface, through, for 
example, a CMP process. Images which are obtained by sensing images of the 
5 wafer after the CMP processing in the detection unit 13 are affected by 
distribution of an amount of reflected light or the like, which is caused by 
fluctuation in thickness of the insulating film in the wafer surface and the 
irregularity of a pattern in a chip, so as to be changed to images with a 
brightness that fluctuation depending upon the location on the wafer. By 

10 performing brightness correction in multiple stages at different frequencies with 
respect to an image that is subject to fluctuation in brightness according to the 
method described above, the influence of the fluctuation in brightness among the 
images can be reduced to make the defect manifest. Thus, it becomes possible 
to improve the rate of detection of a defect. 

15 In this way, after calculating the amount of correction for adjusting for a 

difference in the brightness using plural processing units, a comparison of the 
brightness in corresponding positions of two images is performed in the image 
comparison section 1 10 using an amount of positional deviation and amounts of 
signal correction which are calculated at two different periods. Then, a part 

20 where a differential value is larger than the threshold value TH is detected as a 
defect. 

To summarize the flow of this processing, as shown in Fig. 9, in the image 
comparison processing unit 15, first, an amount of positional deviation between a 
detected image and a reference image is calculated from an image signal, which 
25 is received from the image editing unit 14, in the positional deviation correction 
section 108 (1081), and positional deviation of the detected image and the 
reference image is corrected on the basis of this calculated amount of deviation 
(1082). Next, in the brightness correction section 109, an amount of correction of 



brightness unevenness between corresponding linear and belt-like areas of the 
detected image and the reference image is calculated (1091), and then an 
amount of correction of random brightness unevenness between corresponding 
areas of the detected image and the reference image is calculated for each area 
that is larger than the linear and belt-like areas (1092). 

Next, in the image comparison section 110, a differential image of the 
detected image and the reference image, which is corrected using information on 
the amount of positional deviation between the detected image and the 
reference image calculated in the positional deviation correction section 108, 
information on the brightness unevenness between the linear and belt-like areas 
calculated in the brightness correction section 109, and information on the 
amount of correction of random brightness unevenness calculated for each area 
larger than the linear and belt-like areas, is compared with the threshold value 
signal from the threshold value setting section 1 1 1 to detect a defect. 

Here, in a semiconductor wafer which serves as an object of inspection, 
depending upon the position on a chip, there is an area which is not intentionally 
detected, even if the noise is large, a false defect easily occurs, or there is a 
defect in the area, because damage due to the noise, the false defect, or the 
defect is small. For example, in Fig. 7, the memory mat section 71 consists of a 
set of small repetitive patterns (cells), and the peripheral circuit section 72 is 
basically a set of random patterns. In general, in the memory mat section 71 , the 
pattern density is high and an image obtained by a bright-field illumination optical 
system tends to be dark. On the other hand, in the peripheral circuit section 72, 
the pattern density is low and an image to be obtained tends to be bright. 
Further, in an area in which an image signal to be inputted is large (bright area), 
such as the peripheral circuit section 72, since the level of noise is large and a 
false defect easily occurs, it is necessary to set the threshold value TH slightly 
larger. On the other hand, in the memory mat section 71, since an image signal 
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to be obtained is small (the section is dark), it is necessary to set the threshold 
value TH smaller and perform inspection with high sensitivity. 

In addition, in the case in which a difference in the brightness occurs 
locally due to a difference in the thickness in a specific pattern area between 
5 chips as shown in Figs. 4A and 4B, a false defect easily occurs in this part. Thus, 
it is necessary to set the threshold value TH slightly larger. On the other hand, in 
the case in which a comparison is performed using only the threshold value TH 
for the entire area, it is inevitable to set a slightly larger threshold value which is 
suited to an area with large noise or an area where a false defect easily occurs. 

10 Thus, the sensitivity in the other areas falls. Therefore, as a method of 

performing inspection with a high sensitivity over the entire area, the present 
invention provides means for setting the threshold value according to an area. 

The characteristics 12-1 in Fig. 12 indicates a threshold value for 
preventing a false defect from occurring in a certain position in a chip, indicating 

15 that it is necessary to set different threshold values according to an area. 
Moreover, in a semiconductor wafer, a difference in the brightness between 
chips, tends to be larger in a part closer to the periphery of a wafer. The 
characteristic 12-2 in Fig. 12 indicates a threshold value for preventing a false 
defect from! occurring in a certain position in the wafer, indicating that it is 

20 necessary to set different threshold values according to the position of a chip. In 
order to realize a highly sensitive inspection over the entire area of the 
inspection area in this way, in the threshold value setting section 1 11 of Figs. 1 
and 9, a threshold value is set for each position of the chip in the wafer and each 
position in the chip. An example of a procedure for effecting such setting is 

25 shown in Fig. 13. 

First, a user sets general conditions (13-1) and performs a trial inspection 
(13-2). In this case, if only a threshold value according to a pattern in a chip is 
set, an area for one chip is set as an inspection area (14-1 in Fig. 14). If a 



threshold value according to a position of a chip in a wafer is further set, chips on 
the entire surface of the wafer, or at least from an end to an end of the wafer, are 
set as an inspection area (14-2 in Fig. 14). Then, the user visually confirms the 
presence of detected defect candidates and classifies them into real defects and 
false defects (13-3). For such confirmation, an image, which is sensed by image 
sensing means different from that used at the time of actual inspection, such as 
a high magnification camera, or an image used for the actual inspection, may be 
used. In this case, it is also possible to perform auto defect classification (ADC) 
or the like in parallel with the inspection and to automatically classify defect 
candidates into defects and false defects using a result of the auto defect 
classification or the like without intervention of the user. Then, a result of 
classification is displayed together with the chip, such that it can be seen at a 
glance where in the chip the occurrence of false defects are concentrated (13-4). 

In the case in which plural chips are set as an inspection area, the results 
of inspection in the respective chips are superimposed. Looking at the 
superimposed results, the user sets an area where the user wishes to decrease 
the sensitivity, that is, an area where the threshold value is to be increased, and 
an area where the user wishes to increase sensitivity, that is, an area where the 
threshold value is to be decreased. In this case, as shown in Fig. 15, it is also 
possible to display the areas while being superimposed on design information of 
the chip, such as CAD data, chip layout data, and a chip image. By 
superimposition of display areas on the design information, it becomes easier to 
confirm whether the set areas are appropriate and to change the set areas. In 
addition, an area can be set automatically from the design information of the 
chip. 

Next, a characteristic amount of an area for setting a threshold value for 
each set area is calculated (13-6). For example, the characteristic amount is a 
characteristic amount F(i,j)-G(i,j), an area, or the like, of an area where a false 



25 



defect occurs, such as a density difference value (a difference in brightness 
between the detected image and the reference image) of a false defect in an 
area where the occurrence of a false defect is concentrated, the pattern density, 
brightness, or the like of an area where a false defect occurs, and the like. In 
addition, in the case in which a trial inspection is performed in plural chips, the 
coordinates in a wafer of a chip where the occurrence of a false defect is 
concentrated, the shape of the occurrence of a false defect in the entire wafer, 
and the like are also calculated. Then, a threshold value, which is not a false 
defect, in each area is set from the characteristic amount in each area (13-7). 
For example, in the area where the occurrence of a false defect is concentrated, 
the detection of a false defect can be avoided if the threshold value is set to be 
larger than the density difference value of the false defect. 

In addition, as described above, in an area where the pattern density is 
high or an area where the background is dark, the threshold value is set slightly 
lower because a false defect does not easily occur. On the other hand, in an 
area where the pattern density is low or an area where the background is bright, 
the threshold value is set to be slightly higher because a false defect easily 
occurs. Further, the threshold value is set slightly higher in a chip which is 
spaced from the center of a chip or a wafer where a false defect is concentrated. 
Since information such as the pattern density and the brightness of the 
background can be obtained from an image, division of an area is performed 
automatically. 

As shown in Fig. 16A, the threshold value is basically set automatically 
from the density difference value of a false defect, but it may be set with 
reference to a threshold value set by the user in advance. In addition, as shown 
in Fig. 16B, a threshold value forjudging detection or non-detection can also be 
calculated from the density difference value and the area. Naturally, it is also 
possible for the user to set the area and the threshold value for each area 
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manually. Such automatic setting of the area and automatic threshold setting for 
each area are performed in section 1 1 1 in Fig. 1 , and only a defect is detected in 
the image comparison section 110 using the threshold value. 

Moreover, it is also possible to perform inspection with the threshold value 
determined in 13-7 in step Fig. 13 and repeat steps 13-3 to 13-7 to perform 
tuning of the parameters. In addition, in this case, steps 13-3 to 13-7 can also be 
repeated while changing the optical conditions (focal point, amount of light, 
deflection conditions, etc.) (13-8) to perform tuning of the optical conditions. 

As described above, in accordance with the present invention, image 
comparison processing is performed in parallel in the inspection for comparing 
two images and detecting a defect from a differential value thereof, whereby an 
inspection speed equivalent to or close to an image capturing speed of an image 
sensor can be realized. 

For example, if the image capturing speed of the image sensor is 1.6 
Gpps (pps: pixel per second) at the fastest, even if the processing capability of 
the image comparison processing unit 15 is only 0.8 Gpps, which is half the 
image capturing speed, it becomes possible to realize an inspection processing 
speed of 1 .6 Gpps by adopting a constitution in which two image comparison 
processing units are provided in parallel, as in the present invention. Moreover, 
even in the case in which the speed of the image sensor is higher, such as the 
case in which the image capturing speed of the image sensor becomes equal to 
or higher than 1.6 Gpps, the case in which an image accumulation time of the 
image sensor is reduced according to optical conditions, or the case in which the 
scan width to be scanned by the image sensor at one time is increased, it 
becomes possible to cope with the case without speeding up the image 
comparison processing itself by increasing the number of image comparison 
processing units to N. For example, even in the case in which the image 
capturing speed of the image sensor is further increased to 6 .4 Gpps at the 
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fastest, it is possible to cope with a case having an image processing speed of 
6.4 Gpps by arranging eight image comparison processing units 15 with a 
processing capability of 0.8 Gpps in parallel. 

In addition, by adjusting the difference in the thickness among chips, a 
difference in the sensitivity of each pixel of the image sensor, a difference in the 
an amount of accumulated light due to unevenness of the stage speed, and a 
difference in the brightness among the chips which occurs due to various factors, 
such as fluctuation in illumination (unevenness of colors) at plural different 
frequencies (i.e., an amount of correction is calculated for each of plural different 
areas), it becomes possible to manifest a defect of a feeble signal, which is 
embedded in strong brightness unevenness, and detect the defect. 

Further, by dividing an inspection area into plural areas according to 
coordinates in a wafer and coordinates in a chip to automatically set a threshold 
value for each divided area, the inspection sensitivity in each area can be 
optimized, so as to realize a highly sensitive inspection as a whole. In this case, 
by displaying design information of the chip, such as CAD data and a chip image 
of the chip so that it is superimposed on a state of occurrence of a false defect or 
a result of automatic area setting, confirmation and correction of a set area are 
facilitated. It is also possible for the user to set the threshold value manually. 

The processing of the image comparison processing units 15-1 and 15-2 
according to the present invention, as described above, is realized by software 
processing by the CPU. However, it is also possible to change a core arithmetic 
operation part, such as normalized cross correlation operation and the formation 
of a characteristic space, to hardware processing by an LSI. Consequently, 
further speed-up can be realized. In addition, even if there is a large difference in 
the brightness among compared dies due to a slight difference in the thickness 
of a pattern after a smoothing process, such as a CMP, or a reduction in the 
wavelength of the illumination light, it becomes possible to detect a defect of 20 
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nm to 90 nm according to the present invention. 

Moreover, in inspection of a low k film, for example, an inorganic 
insulating film, such as Si0 2 , SiOF, BSG, SiOB, or a porous silica film, or an 
organic insulating film, such as methyl containing Si0 2 , MSQ, a polyimide film, a 
5 parelin film, a Teflon (registered trademark) film, or an amorphous carbon film, 
even if there is a local difference in the brightness due to in-film fluctuation of a 
refractive index distribution, it becomes possible to detect a defect of 20 nm to 
90 nm according to the present invention. 

An embodiment of the present invention has been described with 

10 reference to a comparative inspection image in an optical visual inspection 

apparatus targeting a semiconductor wafer as an example. However, the present 
invention is also applicable to a comparative image in an electron beam type 
pattern inspection. In addition, the object of inspection is not limited to a 
semiconductor wafer, and the present invention is applicable to a TFT substrate, 

15 a photomask, a printed board, or the like as long as a defect is detected by 
comparison of images. 

As described above, according to the present invention, by performing 
image comparison processing in parallel, an inspection speed corresponding to 
a processing speed depending upon an image capturing speed of an image 

20 sensor, an image accumulation time, a scanning width, and the like can be 
realized. 

In addition, by adjusting, at plural different frequencies, a difference of 
brightness among images to be compared caused by a difference in thickness 
among chips, a difference in the sensitivity of each pixel of the image sensor, a 
25 difference in the amount of accumulated light due to unevenness of the stage 
speed, a difference in the brightness among the chips which occurs due to 
various factors, such as fluctuation in illumination (unevenness of colors), or the 
like, it becomes possible to manifest a defect of a feeble signal, which is 
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embedded in strong brightness unevenness, and detect the defect. 

Further, since a threshold value can be set according to coordinates in a 
wafer and coordinates in a chip, it becomes possible to automatically optimize 
the inspection sensitivity in various places, so that highly sensitive inspection can 
be realized. In this case, by displaying design information of the chip and a 
threshold setting area so as to superimpose one on top of another, adjustment of 
the sensitivity, such as confirmation and correction of the threshold value setting 
area, is facilitated. 

The invention may be embodied in other specific forms without departing 
from the spirit or essential characteristics thereof. The present embodiment is 
therefore to be considered in all respects as illustrative and not restrictive, the 
scope of the invention being indicated by the appended claims rather than by the 
foregoing description and all changes which come within the meaning and range 
of equivalency of the claims are therefore intended to be embraced therein. 
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